CDS

Accession Number TCMCG022C41458
gbkey CDS
Protein Id XP_039170811.1
Location join(8949528..8949702,8949788..8949870,8950186..8950234,8950237..8950279,8950699..8950802,8951148..8951317,8951419..8951533,8951932..8951987,8952076..8952112,8952298..8952398,8952777..8952839,8952922..8952984,8953102..8953184,8953359..8953467,8953567..8953620,8954270..8954357,8954731..8954768,8955329..8955434,8955628..8955794,8955918..8955977)
Gene LOC104448212
GeneID 104448212
Organism Eucalyptus grandis

Protein

Length 588aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA698663
db_source XM_039314877.1
Definition LOW QUALITY PROTEIN: imidazole glycerol phosphate synthase hisHF, chloroplastic [Eucalyptus grandis]

EGGNOG-MAPPER Annotation

COG_category E
Description belongs to the HisA HisF family
KEGG_TC -
KEGG_Module M00026        [VIEW IN KEGG]
KEGG_Reaction R04558        [VIEW IN KEGG]
KEGG_rclass RC00010        [VIEW IN KEGG]
RC01190        [VIEW IN KEGG]
RC01943        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K01663        [VIEW IN KEGG]
EC -
KEGG_Pathway ko00340        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
ko01230        [VIEW IN KEGG]
map00340        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
map01230        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGAGGCGGCGCCGTTCGGCTCGGCATTCACATCTAGATCGCTCTCCCCACCCTCTTCATCGTCTTCCACCTCGCAATCCCTCCTTTGCTTCCATTTCAACAAGGCCCGCCTCAAGCTCAAATCGCCCAGAACCTTCGCCGTTCGCGCCGCCGCCGCTCGGGGAGGAGATTCTGTGGTGACTCTGCTTGATTATGGTGCCGGCAACGTCCGTAGCGTCCGGAATGCTATTCGCCACCTCGGCTTCGACATTAAAGATGTGCAAACTCCTGAAGATATTTTGGGTGCAAATCGCCTCATCTTCCCGGGTGGGGCATTTGCTTCTGCCATGGATGTGTTGAATAAGAAAGGGATGGCTGAAGCTCTCTGCACTTATATTATGGAAGATCGCCCCTTCCTAGGCATATGTCTTGGACTGCAACTACTCTTTGAGTCAAGTGAGGAGAATGGCCCAGTCCGTGGTCTTGGAATAATCCCTGGAGTGGTTGGTCGCTTTGATGCATCCAGTGGTTTGAGAGTGCCTCACATTGGCTGGAATGCCTTGAAAGTTAGAGAAGGCTCTGAAATTTTGGATGATATTGGAAATCACCATGTCTATTTTGTTCACTCCTACCGTGCCCTACCAACAAACGACAACATGGATTGGGTTTCCTCTACCTGCAATTATGGCGACAATTTCATAGCCTCTGTAGGGAAGGGAAATGTGCATGCAGTACAATTTCACCCAGAAAAAAGTGGAGATGTGGGTCTGACGGTATTGAGAAGATTCTTGTCGCCAAAGTCACATGTAACAGAGAAGCCTACAGAAGGGAATGCCTCAAAGCTTGCTAAAAAGGTAATTGCTTGTCTTGATGTGAGGGCAAACGACAAGGGAGATCTTGTTGTTACCAAAGGCGACCAATATGATGTGAGAGAGCGCACAGAAGAGAATGAGGTGAGGAACCTCGGTAAGCCTGTGGACCTAGCTGGGCAGTATTATAAGGATGGAGCAGATGAAGTCAGTTTTCTGAATATTACTGGTTTCCGCGACTTCCCCCTGGGTGACTTACCGATGCTGCAGGTATTGAGATACACATCAGAAAATGTTTTTGTACCACTAACTGTTGGAGGTGGAATTAGAGATTTCACGGACGCAAATGGCAGGTACTATTCCAGTCTAGAAGTGGCTTCAGAATATTTTAGATCTGGGGCGGATAAGGTTTCTATAGGCAGTGATGCGGTTTATGCTGCTGAAGAATATCTAAGAACCGGAGTGAAGAGTGGAAAGAGCAGCTTAGAACAGATATCTAGAGTCTATGGGAATCAGGCAGTGGTAGTGAGCATAGATCCTCGTAGGATGTACATCAAGAGTCCCGAAGATGTGGAGTTCAGATCTACAAGGGTAACAAATCCAGGTCCAAATGGAGAAGAATATGCTTGGTACCAGTGCACGGTCAATGGAGGACGAGAAGGTCGGCCAATTGGAGCTTACGAGCTTGCAAAGGCTGTTGAAGATTTGGGTGCTGGAGAGATATTGCTTAACTGCATTGACTGTGATGGTCAAGGAAAAGGATTCGATGTCGATTTAGTGAAGCTGATTTCTGATGCCGTGAGCATCCCAGTGATAGCAAGTAGCGGTGCGGGCTGTGTGGAGCACTTTACGGAGGTATTTGAGAAGACCAATGCATCCGCTGCGCTTGCTGCAGGGATATTCCACCGGAAGGAGGTGCCGATTCAGGCTGTGAAGGAGCACTTGCTAAAGGAAGGCATAGAAGTAAGAATCTAG
Protein:  
MEAAPFGSAFTSRSLSPPSSSSSTSQSLLCFHFNKARLKLKSPRTFAVRAAAARGGDSVVTLLDYGAGNVRSVRNAIRHLGFDIKDVQTPEDILGANRLIFPGXGAFASAMDVLNKKGMAEALCTYIMEDRPFLGICLGLQLLFESSEENGPVRGLGIIPGVVGRFDASSGLRVPHIGWNALKVREGSEILDDIGNHHVYFVHSYRALPTNDNMDWVSSTCNYGDNFIASVGKGNVHAVQFHPEKSGDVGLTVLRRFLSPKSHVTEKPTEGNASKLAKKVIACLDVRANDKGDLVVTKGDQYDVRERTEENEVRNLGKPVDLAGQYYKDGADEVSFLNITGFRDFPLGDLPMLQVLRYTSENVFVPLTVGGGIRDFTDANGRYYSSLEVASEYFRSGADKVSIGSDAVYAAEEYLRTGVKSGKSSLEQISRVYGNQAVVVSIDPRRMYIKSPEDVEFRSTRVTNPGPNGEEYAWYQCTVNGGREGRPIGAYELAKAVEDLGAGEILLNCIDCDGQGKGFDVDLVKLISDAVSIPVIASSGAGCVEHFTEVFEKTNASAALAAGIFHRKEVPIQAVKEHLLKEGIEVRI